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OVERLAY ERROR MODEL, SAMPLING STRATEGY AND ASSOCIATED 
EQUIPMENT FOR IMPLEMENTATION 

CROSS-REFERENCE TO RELATED APPLICATION 
This application claims the priority benefit of Taiwan application serial no. 
90104309, filed on February 26, 2001. 

BACKGROUND OF THE INVENTION 

Field of Invention 

The present invention relates to a method for manufacturing semiconductor 
devices and the associated equipment required by the method. More particularly, the 
present invention relates to an overlay error model, a sampling strategy and associated 
equipment for implementation. 
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15 Description of Related Art 

Microlithography is an important technique closely related to the manufacturing 
of semiconductor wafers. Conventional microlithographic stations using a scanning 
projection method for manufacturing very-large-scale-integrated (VLSI) circuits have 
now been largely replaced by step-and-repeat systems. To produce semiconductor 

20 devices capable of operating normally, the overlays for patterning out various layers 
must be designed within an acceptable tolerance level. 

High resolution and alignment are important factors for producing quality 
devices on a silicon wafer during photographic exposure. Fig. 1 is a sketch of a 
conventional dual alignment system used by semiconductor manufacturers. In the dual 
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alignment system 10, an alignment detector emits a light beam 12. The light beam 12 
travels through a reticle 13 and a protection lens 14 to arrive at an alignment mark 16 on 
a wafer 15 so that the wafer 15 is aligned. After wafer 15 alignment, various pattern 
layers above the wafer 15 are exposed. Therefore, controlling the alignment accuracy 
of various overlapping overlays is important. To improve production yield, overlay 
errors due to misalignment during photo-exposure using a microlithographic machine 
must be reduced to a tolerable level. 

The following is a list of commonly used symbols in error determination and 
their respective meanings; 

(x, y): an intra-field coordinate system relative to the center of an exposure field 
and related to photomask factors; 

(X, Y); an inter-field coordinate system relative to the center of a wafer and 
related to wafer factors; 

d x , dy'. intra-field overlay error relative to intra-field coordinate system (x, y); 

d x , d Y ; inter-field overlay error relative to inter-field coordinate system (X, Y); 

d x+x ; the sum of intra-field overlay error along horizontal axis x and inter-field 
overlay error along horizontal axis X; 

d y+Y ; the sum of intra-field overlay error along vertical axis y and inter-field 
overlay error along vertical axis Y; 

T x? T y : intra-field translation error; 

T x? T Y : inter-field translation error; 

T x+X ; the sum of intra-field horizontal translation and inter-field horizontal 
translation; 

T y+Y ; the sum of intra-field vertical translation and inter-field vertical translation; 
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E x+Y : inter-field expansion error; 
M x , My: intra-field magnification error; 
Ry! intra-field rotation error; 



R x , R Y : inter-field rotation error; 



rx, ry; residual error after intra-field compensation, indicating random error; and 

r x , r Y : residual error after inter-field compensation, indicating random error. 

The appearance of overlay errors can be attributed to intra-field and inter-field 
errors. A 'field' refers to an exposure field on a wafer each time machine-alignment is 
required. The ' intra-field overlay error' refers to the problem caused by a 
misalignment between a light source filter lens and a mask. Therefore, intra-field 
overlay error is measured using the center point of a sampled exposure field as the 
origin. In addition, the so-called 'inter-field overlay error 5 refers to the problem 
caused by a misalignment between the mask and the wafer. Hence, the inter-field 
overlay error is measured using the center point of the wafer as the origin. 

Fig. 2 is a diagram showing a method of measuring overlay error. As shown in 
Fig. 2(a), the overlay error is the displacement error [7] (displacement error is 
represented by d) between an image exposure field 21 and a previous image exposure 
field 22. The values listed out as xl, x2, yl and y2 are the actually measured 
parameters of overlay errors representing the relative displacement of two different size 
rectangular exposure fields. As shown in Fig. 2(b), the response variables of the 
overlay errors can be indicated as follows: 
xl-x2 

2 (1) 
2 (2) 
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where d x+x represents the sum of intra-field overlay error along horizontal axis x 
and inter-field overlay error along horizontal axis X and cL +Y represents the sum of intra- 
field overlay error along vertical axis y and inter-field overlay error along vertical axis 
Y. When d x+x = 0, this indicates that the small rectangle is located in exactly the center 
5 of the large rectangle horizontally. Hence, overlay error in the x direction is zero. 
Similarly, when d y+Y = 0, this indicates the small rectangle is located in exactly the 
center of the large rectangle vertically. Hence, overlay error in the y direction is zero. 
If d x+x = 0 and d y+Y = 0 at the same time, this indicates that the two consecutive image 
exposures produce no overlay errors. 

0 Fig. 3 is a sketch showing a sampling method for inter-field overlay and intra- 
field overlay on a wafer. The central point of an exposure field 3 1 on the wafer 3 5 has 
a coordinate (X, Y) relative to an origin at the center of wafer 35. Exposure fields at 
different locations are also selected accordingly. This particular mode of point 
selection is referred to as an inter-field sampling. An overlay point 33 surrounding the 

1 exposure field 31 may be selected. The overlay point 33 has a coordinate (x, y) 
relative to an origin at the center of the exposure field 31. This particular mode of 
point selection is referred to as an intra-field sampling. 

Due to an increase in threshold dimension and a reduction of feature size in 
VLSI circuits, factors affecting overlay errors have become increasingly complicated. 
Table 1 is an outline of all the factors that may produce overlay errors according to the 
investigation by Schmidt [13]. The principle factors include system problems, stepper 
problems, reticle accuracy, linewidth variation and wafer distortion. 



Table 1 : Major factors leading to overlay errors 



Source of Problem 


Overlay Errors 


System 


Vibration 
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Temperature 


Stepper 


Alignment Error 

Lens Distortion (Optical Alignment) 
Wafer Clamping 
Water 1 able Errors 


Reticle Accuracy 


Reticle In-plane Distortion 

raiiQTli r^laCemclll JQliui 

Reticle Clamping 


Linewidth Error 


Wafer (exposure, development, etching) 
Reticle (exposure, development, etching) 


Wafer Distortion 


Flatness/Curvature 
Pattern Movement/Slip 



The factors leading to intra-field overlay errors and inter-field overlay errors are 
presented in Figs. 4 and 5. Fig. 4 is a sketch showing the factors related to intra-field 
overlay errors. Fig. 5 is a sketch showing the factors related to inter-field overlay 
errors. Mask-related intra-field overlay errors include translation, rotation and 
magnification while wafer-related inter-field overlay errors include expansion and 
rotation. 

An overlay error model having six parameters has previously been developed by 
Perloff [10]. In this model, errors contributed by translation (T x? T Y ), rotation (R x , R Y ) 
and wafer expansion (E^ E Y ) are added together to produce a linear sum for describing 
inter-field overlay errors. The following formulas are used. 

d x = T x - R X Y + E X X ^ 

dy — Ty — RyX + EyY 

However, factors related to stage motion are absent in Perloff s model. 

In Hasan et al. [6], registration data and specially designed control structures are 
incorporated to obtain error data. The data is used to estimate the accuracy of stage- 
stepping in (x, y) coordinates and perform necessary rectification accordingly. 
McMillen and Ryden [8] introduce step-and-repeat system, thereby expanding the 
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model used by Hasan [6] to develop an image field placement deviation (IFPD) model. 
The IFPD model takes into account intra-field errors such as translation (T x , T y ) ? 
rotation (R^, unequal edges (t 1? t 2 ), magnification (M x , M,) and 3 -dimensional 
distortion (E). The errors are described by the following formulas. 

5 d x = T x +M x - R x y + t x x 2 + t 2 xy - E(x 3 + xy 2 ) (5) 

d y = T y + R y x + M y y + t x xy + t 2 y 2 - E(yx 2 + y 3 ) (6) 

In addition, Peski [11] has also developed a technique that utilizes the wafer-to- 
reticle aligning capacity of a wafer stepper to carry out precision control of the wafer 
exposure equipment. Peski incorporates stepper parameters into the design. The 
10 stepper parameters include an X marking factor, a Y marking factor, an X positive 
crossing and a Y positive crossing. The following formulas are used. 
d x = S x [int(C I 5) - 1.5][0.45] + M[C - 5{int(C / 5)} - 2][0.09] 

+ [$ina][R - 5{int(i? / 5)} - 2][0.09] + [sin b[R - 9.5][0.09] ^ 

d y = S y [int(R I 5) - 15][0.45] + M[R- 5{int(7?/ 5)} - 2][0.09] 
15 + [sin a][C - 5{int(C / 5)} - 2][0.09] ^ 

where 

Sx: X marking factor; 
Sy: Y marking factor; 
C: 0 - 19 columns; 
20 R: 0 - 19 rows; 

M: Magnification; 

a: die rotation angle; 

b: array positive crossing angle; 
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int: integral sign 

Arnold [3] discovers that a principle factor in misalignment is the distortion of 
lines parallel to the x-axis or the y-axis by a scanning stepper. Arnold makes due 
consideration to parameters including grid errors, lens errors and distortion along x, y 
direction. Moreover, lens errors caused by 3 -dimensional lens distortion are also 
considered. 

Brink [4] has developed an overlay model with due consideration of intra-field 
and inter-field effects. The intra-field model is based on the investigation by 
MacMillen and Ryden [8] with further incorporation of 5-dimensional distortion 
parameters. The following formulas are produced. 

d x = r x + M x x - R x y - Txx x 2 - r yx xy + W x y 2 + D 3x x(x 2 + y 2 ) 



r y + M y x -R y ~ t^x 2 - Tyy xy + W y x 2 + D 3y y(x 2 + y 2 ) 



+ D 5y y(x 2 +y 2 ) 2 +r y 

where 

XX 5 . 



T yy> T 



: intra-field slant coefficients; 
yx : inter-field slant coefficients; 



W * : clamping distortion in the x direction; 
W 

y : clamping distortion in the y direction; 

A y . 3 -dimensional distortion coefficients; and 
Ax? D sy . 5-dimensional distortion coefficients. 



(9) 



(10) 
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The inter-field model of Brink et al. is a combination of the model by Perloff [10] 
and the one by Arnold [3] with regard to the distortion parameters. The following 
formulas are derived. 

d x =T x + E X X - R X Y + B x Y 2 +r x (u) 

^ Cly = Ty +EyY—RyJC + ByJf 2 ^Ty ( 1 2) 

In addition, Lin and Wu [7] combine the intra-field error and the inter-field error 
in both the x direction and the y direction to produce the following formulas. 

d x+x = T X -R X Y + M X X + B X Y 2 + T x - R x + M x X - T^X 2 
- T^XY + W x y 2 + D 3x x(x 2 + y 2 ) + D 5x x(x 2 + y 2 f 
10 +D lx x(x 2 +y 2 y+r x (13) 

d y+Y = T r -R y X + M y y + B y X 2 +T y + R y x + M y y - T^xy 
-T^+Wx'+D^yix'+y^ + D.vix'+y 2 ) 2 



+ D 7y y(x 2 +y 2 y + r y 



(14) 



where xy y represents step distortion coefficients. 
15 Table 2 lists all the parameters considered in major overlay models. 
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Table 2: a listing of all the parameters considered in major overlay models 



Authors (year) 


Parameters in Overlay Model 


Perloff(1978) 


Translation 

Rotation 

Expansion 


MacMillen and Ryden (1982) 


Translation 
Rotation 
Expansion 
Unequal Lens Edges 
Lens Distortion 


Peski(1982) 


Marking Factors 
Array Crossings 
Lens Distortion 
Magnification Ratio 


Arnold (1983) 


Translation 
Rotation 
Expansion 
Unequal Lens Edges 
Lens Distortion 
Distortion Coefficients 


Brink et aL (1988) 


Translation 

Rotation 

Expansion 

Unequal Lens Edges 

3 -dimensional Lens Distortion 

5-dimensional Lens Distortion 

Distortion Coefficients 


Lin and Wu (1998) 


Translation 
Rotation 
Magnification 
Intra-field Slant Angle 
Clamping Distortion 
Multiple-Stage Distortion 



According to the aforementioned description, the theoretic model of 
conventional overlay error such as the ones shown in equations (13) and (14) has 
altogether 26 parameters. In real applications, such a system is complicated and costly 
to implement in a common manufacturing facility. Many investigations 

regarding the factors that may lead to overlay errors, their mathematical model as well 
as method of controlling the overlay errors have been conducted. However, 
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conventional models are rarely implemented in production. This is because the many 
parameters considered by overlay models often differ from the data measured by a 
typical correcting system. Therefore, the parameters in most theoretical models and 
the actual measured values must be compromised. In addition, the cost of making 
5 overlay measurements and the number of overlay samplings on a wafer are restricted. 
In general, more samples have to be taken if more parametric values are required. To 
boost overlay precision with a limited number of measurement, a good sampling 
strategy must be selected in addition to from finding a suitable overlay error model 

10 SUMMARY OF THE INVENTION 

Accordingly, one object of this invention is to provide an overlay model for 

measuring overlay errors. The overlay model is based on measurable and rectifiable 

parameters in a correcting system. According to the overlay model, a sampling 

procedure is also provided so that an optimal sampling strategy is found. Thereafter, 
15 according to the parameters and the model, various balancing acts and deductions are 

executed so that the overlay errors are compensated to within a tolerable range. 

Ultimately, overlay errors in microlithographic process are minimized. 

A second object of this invention is to provide a model for assessing overlay 

errors and sampling procedure so that a model having the best sampling strategy is 
20 found. Thereafter, the overlay errors are compensated so that precision of overlay 

pattern is improved. 

A third object of this invention is to provide a model and a sampling procedure 
for producing an overlay error measuring and compensating device. 
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To achieve these and other advantages and in accordance with the purpose of the 
invention, as embodied and broadly described herein, the invention provides a model for 
assessing overlay errors. The parameters in the model include inter-field translation, 
magnification, rotation, expansion and non-positive crossing as well as intra-field 
translation, magnification and rotation. These parameters are selected according to 
measurement parameters and statistics of an actual stepper machine. The integrative 
overlay error model uses the following formulas: 

d x + x = T x+X + E X X — (N + 6)7 + M> - R y y + r x+x ^ 

d y+Y = T^ r + E y Y~(9-N)X + M' y y + R y x + r y+y (24) 

In addition, this invention also provides a sampling procedure based on the 
aforementioned overlay model. The sampling strategy includes the following steps. 

In step (a), the state within the intra-field, that is, the number and positions of the 
selected overlays within the intra-field are determined. Each type of sampling state 
and sum of the overlays are then determined. Next, each type of intra-field overlay 
sampling state similarly on the wafer is ranked before the locations and actual overlay 
error values of each overlay type of the sampling states within the intra-field are 
measured. 

In step (b), the least square method is utilized using the measured values to 
obtain the coefficients according to the overlay error model formulas (23) and (24). 
Next, related square coefficient values between each type of overlay sampling states 
within the intra-field and the overlay model (or R-square represents the degree of the 
assessed variance that can be explained by the model) are computed. The highest 
related square coefficient among the overlay sampling states within the intra-field is 
selected so that model compensation is improved. 

12 
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In step (c), a plurality of inter-field overlay sampling patterns is determined. 
With the patterns, the locations and number of types of overlay sampling states within 
the inter-field are determined. And together with the selected highest square 
coefficient from the overlay states within the intra-field in step (b), the locations and 
actual overlay error values of each overlay sampling pattern within the inter-field are 
then measured. 

In step (d), the least square method is utilized with the measured values in step 
(c) to obtain the coefficients in the model formulas (23) and (24). Related square 
coefficient values between each overlay sampling pattern within the inter-field and the 
overlay model are computed. An inter-field overlay sampling pattern having the 
highest related square coefficients is selected, and thereby finding the optimal sampling 
pattern that corresponds to the model and improving the effect of model compensation. 

In step (e), the combination of intra-field overlay sampling state and inter-field 
overlay sampling pattern and its relationship with the model are determined so that the 
highest related coefficients are obtained. Then, compensation and modification using 
the measured overlay errors for the same stepper machine are performed, so that the 
number of overlay errors of the exposed patterns is reduced to a tolerable range and 
yield of semiconductor device is improved. 

Another object of this invention is to provide a device designed according to the 
aforementioned strategic sampling procedure. The device can be used inside a step- 
exposure system for reducing overlay errors of pattern on a wafer. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary, and are intended to provide further 
explanation of the invention as claimed. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings are included to provide a further understanding of 
the invention, and are incorporated in and constitute a part of this specification. The 
drawings illustrate embodiments of the invention and, together with the description, 
serve to explain the principles of the invention. In the drawings, 

Fig. 1 is a sketch of a conventional dual alignment system used by 
semiconductor manufacturers; 

Fig. 2 is a diagram showing a method of measuring overlay error; 

Fig. 3 is a sketch showing a sampling method for inter-field overlay and intra- 
field overlay on a wafer; 

Fig. 4 is a sketch showing the factors related to intra-field overlay errors; 

Fig. 5 is a sketch showing the factors related to inter-field overlay errors; 

Fig. 6 is an intra-field overlay sampling scheme according to one preferred 
embodiment of this invention; 

Fig. 7 is a sketch of the eight types of intra-field overlay position sampling states 
according to one embodiment of this invention; 

Fig. 8 is a sketch of an inter-field overlay sampling scheme according to one 
embodiment of this invention; 

Fig. 9 is a sketch of an inter-field overlay sampling pattern (i) according to one 
embodiment of this invention; 

Fig. 10 is a sketch of an inter-field overlay sampling pattern (ii) according to one 
embodiment of this invention; 

Fig. 1 1 is a sketch of an inter-field overlay sampling pattern (iii) according to 
one embodiment of this invention; 

14 
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Fig. 12 is a sketch of an inter-field overlay sampling pattern (iv) according to 
one embodiment of this invention; 

Fig. 13 is a sketch of an inter-field overlay sampling pattern (v) according to one 
embodiment of this invention; 

Fig. 14 is a sketch of an inter-field overlay sampling pattern (vi) according to 
one embodiment of this invention; 

Fig. 15 is a sketch of an inter-field overlay sampling pattern (vii) according to 
one embodiment of this invention; 

Fig. 16 is a sketch of an inter-field overlay sampling pattern (viii) according to 
one embodiment of this invention; 

Fig. 17 is a sketch of an inter-field overlay sampling pattern (ix) according to 
one embodiment of this invention; 

Fig. 18 is a sketch of an inter-field overlay sampling pattern (x) according to one 
embodiment of this invention; 

Fig. 19 is a graph showing the relationship between related square coefficient 
and the overlay sampling number according to the embodiment of this invention; 

Fig. 20 is a bar graph showing residual value between a conventional intra-field 
overlay sampling state (a) and the optimized intra-field overlay sampling state (d) 
according to this invention; and 

Fig. 2 1 is a graph analyzing the distribution of residual value in the intra-field 
overlay sampling state (d) according to this invention. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Reference will now be made in detail to the present preferred embodiments of 
the invention, examples of which are illustrated in the accompanying drawings. 
Wherever possible, the same reference numbers are used in the drawings and the 
5 description to refer to the same or like parts. 

In this invention, actual measurements by a stepping machine are analyzed to 
obtain a series of geometric properties of overlay errors such as expansion parameter (E) 
in the inter-field overlay error pattern. Typically, the overlay error increases towards 
the edges of a wafer. In addition, for rotation parameter (R), both inter-field and intra- 
10 field overlay error patterns include entirely different rotation pattern. 

Hence, for inter-field overlay errors, this invention utilizes actual measurements, 
simulations and statistics produced by the stepping machine to select four most 
important parameters including translation (T), magnification (M), rotation (R) and 
expansion (E). Ultimately, an inter-field overlay model is obtained having the 
15 following formulas: 

d x^T x +E x X-R x Y + r x (15) 

d Y = T Y + E Y Y — R Y X + r x ^ 

However, actual measurements and error compensations of the parameters R x 
and R Y in manufacturing are based on non-positive crossing (N) parameter and angular 

20 rotation ( # ) parameter. Hence, a variable transformation of the rotation parameter (R) 
is required. Assume, 

N= Rx ' Rr 

2 (17) 
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2 (18) 
the inter-field overlay model according to this invention uses the following formulas: 

d x = T x +E x X-(N + 0)Y + r x (]9) 

d Y = T r +E 7 Y-(e-N)X + r x (2Q) 

Similarly, for intra-field overlay errors, this invention utilizes actual 
measurements, simulations and statistics produced by the stepping machine to select 
three most important parameters including translation (T), magnification (M) and 
rotation (R). Ultimately, an intra-field overlay model is obtained having the following 
formulas: 

d x =T x +M:x-R x y + r x 



(21) 
(22) 

For a lens having a magnification M, the variable M * is defined to be 



d y = T y +M' y y + R y x+r y 



M' = AM = M* M x + i , M' • , . , „m 

*. In other words, iV **is a multiple of iKY * and the multiple is 

dependent upon the magnification of the lens. 

The inter-field overlay model formulated in equations (19) and (20) and the 
intra-field overlay model formulated in equations (21) and (22) are combined, thus 
producing the following integrative overlay error model formulas: 

< + x - T x+X + E X X - (N + 0)Y + M x x -R y y + r x+x 

d y+r = T y+r +E r Y-(0-N)X+ M' y y + R y x + r y+r 

The integrative overlay error model of this invention has several advantages. 
The total number of parameters considered is kept to a minimum so that the model can 
be easily implemented with minimal cost. Moreover, all the parameters used in this 
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invention can be measured in actual production process so that the errors can be 
compensated directly. Thus, measured data and set variables are linked together by the 
integrative overlay error model. 

After deciding upon the integrative overlay error model, the type of sampling 
5 strategy must be determined so that the parameters used in the model can be assessed 
and compensated. A strategic sampling procedure involves several stages. In the 
first stage, intra-field overlay states must be determined. In other words, the number 
and location of intra-field overlays are selected. In the second stage, the total number 
of sampled overlays on a wafer is determined so that the number and location of the 

10 intra-field overlay states on the wafer is set. In other words, the inter-field overlay 
pattern is determined. In the third stage, according to the model provided in this 
invention, related R-square values between different intra-field overlay states are 
estimated for identical positioning on the wafer. In the fourth stage, according to the 
model provided in this invention and under identical intra-field overlay states, in other 

15 words, based on the highest related R-square found in the third stage, related R-square 
values between different inter-field overlay patterns are estimated. Finally, in the fifth 
stage, based on the aforementioned results and actual wafer microlithographic process, 
the best overlay error sampling procedure is selected for compensating overlay errors. 
The following is a detailed description of the strategic sampling procedure. 

20 In stage one, intra-field overlay sampling states are determined. According to 

formulas (23) and (24), the overlay errors and the intra-field coordinates (x, y) are 
related. In actual measurement, the intra-field straight coordinate (x, y) is transformed 
into a polar coordinate (r, Fig. 6 is an intra-field overlay sampling scheme 

according to one preferred embodiment of this invention. In Fig. 6, selected peripheral 
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overlay regions (1) - (8) are at a radius r from a central region (9) and an included angle 

# forms between the radius vector and the x coordinate axis. The number of intra-field 
overlay samples to be used has been investigated in the past [12]. In production 
applications, each field selected on a wafer shown in Fig. 6 will pick up five of the nine 
overlays for measurements. The five overlays includes the central overlay (9) and four 
of the eight peripheral overlays (1) - (8). 

According to the above sampling scheme, altogether 70 types of intra-field 
overlay states are produced. Due to cost and yield considerations, not every one of the 
intra-field overlay states is selected. 

According to the intra-field overlay model formulas (21) and (22), parameters 
including translation (T), magnification (M) and rotation (R) are all related to intra-field 
overlay errors. The effect of translation on overlay is a constant. The magnification 
for the overlays (1) and (2) along the y- axis are identical. Similar properties exist for 
the overlay pairs (3) and (4), the overlay pairs (5) and (7) and the overlay pairs (6) and 
(8). In like manner, the magnification for the overlays (1) and (3) along the x- axis are 
identical. Similar properties exist for the overlay pairs (2) and (4), the overlay pairs (5) 
and (6) and the overlay pairs (7) and (8). For the rotation parameter, the larger the 
angular difference (2 1) between overlays, the more accurate the estimation for the 
parameters and This is because a larger increase in ^ ~ &\ ? even if one G f t k e 
selected region has minor rotation, leads the formulas (21) and (22) to develop into a 
greater difference. Hence, the parameters R x and Ry can be estimated more accurately. 

To test the above rules to determine the best intra-field overlay sampling state, 
the overlays in Fig. 6 are divided into four different pairings. The pairs include (1) and 
(5), (2) and (7), (3) and (6) and (4) and (8). An overlay is selected from each pair. In 
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theory, there are altogether 2x2x2x2 = 16 combinations. After careful selection, six 
representative states are chosen in the invention. Fig. 7 is a sketch of the eight types of 
intra-field overlay position sampling states according to one embodiment of this 
invention. As shown in Fig. 7, the states includes (a) to (f) and two sampling states 
(1)(5)(4)(8) and (2)(7)(3)(6), that is, (g) and (h). Thus, this invention provides 
altogether nine sampling states for analyzing intra-field overlay model errors. 

In stage two, inter-field overlay sampling patterns are determined. After the 
determination of intra-field overlay sampling states (including the number and location 
of overlays in the intra-field), the number of overlay samples from the wafer is selected. 
Hence, the distribution of number and location of intra-field overlay states on the wafer 
can be determined. In a word, inter-field overlay sampling patterns are determined. 
Fig. 8 is a sketch of an inter-field overlay sampling scheme according to one 
embodiment of this invention. To cany out inter-field sampling according to this 
invention, the inter-field straight coordinate (X, Y) is transformed into a polar 
coordinate^, 

Before carrying out inter-field overlay sampling, the number of overlays used in 
the wafer must first be determined. According to size and quality of production wafers, 
the number of wafer overlays selected includes 10, 20, 50, 100, 200 and 255. Using 
the determined inter-field overlay states as a base, a five-overlay selection is also 
included. However, if 255 overlays are chosen on the wafer, altogether 51 inter-field 
overlay states may be positioned across the wafer. Under the same token, a 200- 
overlay selection will produce 40 inter-field overlay states and 20-overlay selection will 
produce four inter-field overlay states. All these inter-field overlay states on the wafer 
are commonly referred to as 'inter-field overlay sampling patterns'. 
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In general, a greater number of overlays selected on the wafer yields more 
accurately measured overlay errors. However, cost of such measurements also 
increases proportionately. Therefore, increasing assessment accuracy without a 
corresponding increase in cost is a major issue in this invention. 

A comparison for different number of overlays on a wafer is described in a 
subsequent section. Here, the number of overlays used in this invention is the number 
likely to be chosen by wafer manufacturers such as 20. Figs. 9 - 18 are sketches of 
inter-field overlay sampling patterns according to the embodiment of this invention. 

In Fig. 8, a related method of selecting inter-field overlays is shown. Radius r 
is the distance from the center of the wafer to the center of a particular selected overlay. 
Angle # is an angular measurement whose pattern is similar to the one in Fig. 7. 
According to the formulas (19) and (20) of the inter-field overlay model, expansion 
parameters E^ and E Y are related to the coordinate (X, Y) such that E x is related to X 
while E Y is related to Y. Simulation of the expansion parameter patterns have 
indicated that overlay errors increase towards the outer edges of the wafer. Even if 
there is only minor expansion near the central region of the wafer, the outer region of 
the wafer may accumulate substantial overlay errors. Hence, overlays in the outer 
edge regions will have a greater radius r. Since overlay errors are greater in these 
outer regions, compensation is also much easier. Thus, the selection of overlays in the 
outer wafer region is better able to find the expansion parameters E x and E Y . 

To estimate the difference in inter-field overlay sampling, this invention 
provides an inter-field sampling strategy. The sampling strategy pays close attention 
to the radius r and the angle As shown in Figs. 9-18, the strategy includes ten 
different inter-field overlay sample patterns. The radius r is the distance between the 
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origin of the wafer to the selected region while the angle # is the rotation angle shown 
in Fig. 9. The inter-field overlay sampling location in each pattern is represented by 
state (d) in Fig. 7. 

In stage three, inter-field sampling states are compared. As shown in Fig. 7, 
5 eight different overlay location sample states are employed in this invention. Each 
sampling state has five overlays including four peripheral overlays and one central 
overlay. To compare different sampling states, different overlay numbers and related 
square coefficients (R-square values) of the model provided by this invention, a 
multiple linear regression analysis [5] is used to find a model similar to the one depicted 

10 by formulas (23) and (24). In addition, the least square method is used to estimate 
corresponding parametric and related R-square values. 

To obtain related inter-field overlay sample states, related R-square values of the 
aforementioned eight sampling states are compared in relationship with the different 
overlay sample numbers such as 255, 100, 50, 20 and 10. Because each selected 

15 region has five overlays, 51 regions (not shown) are produced if a decision to use 255 
overlays is chosen. Other overlay numbers may be selected in wafer sampling and 
compared. The related square coefficients resulting from a comparison between the 
inter-field sampling state and the sampling number are listed in Table 3. The sampling 
states in Table 3(1) are identical to state (a) in Fig. 7, which is the sampling states 

20 currently employed by wafer manufacturers. Due to cost consideration, an overlay 
sampling number of twenty is employed by most wafer manufacturers. 

Table 3: Related square coefficients between intra-field sampling states and 
sampling numbers 
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Overlay 

sample 

number 


Intxa-state 

sampling 

state 


(a) 


(b) 


(c) 


(d) 


(e) 


(f) 


(g) 


(h) 


255 


99.4% 


99.7% 


99.4% 


99.6% 


99.8% 


99.7% 


99.6% 


99.8% 


100 


97.2% 


94.7% 


96.1% 


98.1% 


99.3% 


95.2% 


93.1% 


95.4% 


50 


95.8% 


97.2% 


96.7% 


98.3% 


96.5% 


95.1% 


90.6% 


94.1% 


20 


87.1% 


96.8% 


92.1% 


96.5% 


91.0% 


88.5% 


76.7% 


81.1% 


10 


71.5% 


83.2% 


61.2% 


83.6% 


79.7% 


73.8% 


52.5% 


69.7% 



According to the results in Table 3, measured errors between different sampling 
states are minor when the number of sampling overlays is relatively large (for example, 
255 and 100). However, when the sampling number is relatively small (for example, 
10 and 20), significant differences in measured errors are observed. Considering the 
sampling number employed by most wafer manufacturers (typically twenty overlays), 
the sampling states (d) in Fig. 7 have a higher resemblance and a smaller residual 
parametric values when compared with conventional sampling state (state (a) in Fig. 7). 

The results in Table 3 also show that as the number of sampling overlays 
increases, higher R-square values are obtained. This implies that as the number of 
sampling increases, more data are obtained so that better error compensation is possible. 
Fig. 19 is a graph showing the relationship between related square coefficients and the 
overlay sampling numbers according to the embodiment of this invention. The graph 
is based on column (d) of Table 3. Hence, a strategic planner with consideration only 
of inter-field factors may decide the number of sampling overlays (sampling cost) and 
the resemblance of the overlay error model (level of related square coefficients) through 
the data presented in Fig. 19. In other words, this invention provides a systematic 
decision method that can be built into related equipment. 

In the fourth stage, inter-field sampling patterns are compared. According to 
Figs. 9-19, ten different sampling patterns are provided. Each sampling pattern has 
four selected regions based on radius of the wafer and rotated angle. Each region 
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further includes five overlaying points. Therefore, each type of sampling pattern has 
altogether twenty overlays, which are based on the sampling number of most current 
wafer manufacturers. To compare various inter-field sampling patterns, this invention 
also employs a multiple linear regression analysis to derive the formulas (23) and (24) 
provided by the model. In addition, a least square method is used to estimate 
corresponding parametric values and related square coefficients between different 
sampling patterns. 

The results of comparing related square coefficients between inter-field 
sampling patterns and intra-field sampling states are shown in Table 4 below. The 
sampling pattern (i) is the current sampling method employed by current wafer 
manufacturers. 



Table 4: Parametric values of inter-field sampling patterns and related R-square 

values 



Param- 
etric value 
and 

related R- 

square 

value 


Inter-field 

sampling 

pattern 


(i) 


(ii) 


(iii) 


(iv) 


(v) 


(vi) 


T x (um) 


-0.416 


-0.157 


-0.144 


-0.172 


-0.20 


-0.160 


TyGihi) 


-0.122 


-0.134 


-0.119 


-0.117 


-0.09 


-0.135 


^(urad) 


0.019 


0.033 


0.035 


0.033 


0.029 


0.005 


Ex(ppm) 


0.12 


0.073 


0.092 


0.067 


-0.008 


0.077 


Ey(ppm) 


0.16 


0.29 


0.21 


0.05 


0.012 


-0.012 


N(^rad) 


0.008 


0.023 


0.011 


0.002 


0.004 


0.07 


Related R-square 


94.1% 


87.2% 


91.3% 


86.5% 


72.5% 


80.1% 


Error Norm 


0.442 


0.691 


0.532 


0.897 


0.889 


0.801 
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Table 4: Continuation 



Param- 


Inter-field 


(vii) 


(viii) 


(ix) 


(x) 


True 


etric value 


sampling 










Value 


and 

related R- 


pattern 












square 
value 














T x (Mm) 


-0.177 


-0.156 


-0.181 


-0.164 


-0.167 


T Y (nm) 


-0.123 


-0.137 


-0.128 


-0.132 


-0.128 


^(Mxad) 


0.009 


0.017 


0.013 


0.019 


0.021 


E x (ppm) 


0.078 


0.008 


0.092 


0.087 


-0.094 


E v (ppm) 


0.035 


0.123 


0.044 


0.21 


0.19 


N(^rad) 


0.005 


0.012 


0.012 


0.005 


0.005 


Related R-square 


88.3% 


95.6% 


86.6% 


96.7% 




Error Norm 


0.712 


0.423 


0.884 


0.401 





According to the results shown in Table 4, the following conclusions can be 
drawn: (1) if the number of sample is restricted, a larger sampling point radius and a 
more symmetrical sampling pattern produce a higher related R-square value; (2) if 
rotational errors are present, inter-field sampling pattern (i) is much better at estimating 
an accurate rotation angle relative to other sampling patterns; and (3) if expansion errors 
are present, inter-field sampling pattern (iii) is much better at estimating an accurate 
rotation angle relative to other sampling patterns. Hence, strategic planners can select 
the sampling pattern for the wafer and the degree of resemblance of the overlay error 
model (magnitude of R-square values) based on the data shown in Table 4. 

In stage five, sampling strategies are integrated. To measure overlay error 
precisely during in-process manufacturing, inter-field sampling pattern (a wafer factor) 
must be considered aside from intra-field overlay states (a masking factor). 

As shown in Figs. 7 and Figs. 9-18, there are altogether eight types of intra- 
field overlay sampling states. Each sampling state has 10 inter-field sampling patterns. 
Hence, a complete sampling includes 80 combinations. Table 5 uses the formulas (23) 
and (24) to estimate and compare the aforementioned 80 types of combination and 
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10 



produces related square coefficients. Therefore, an effective and high resemblance 
sampling strategy that fits the model of this invention is generated. 

According to the results in Table 5, the intra-field sampling states (d) and (e) 
have high R-square values. Hence, no matter what type of inter-field sampling pattern 
is chosen, the intra-field sampling states (d) and (e) have optimal square coefficients. 
Compared with Table 4, radius is larger, sampling pattern is more symmetrical and 
related square coefficients are higher. Thus, the intra-field sampling state (d) and the 
inter-field sampling pattern (viii) and (x) combination produce higher square 
coefficients. 

Table 5: Related square coefficients of intra-field sampling state and inter-field 
sampling pattern 



Inter-field 

sampling 

pattern 


Intra-state 

sampling 

state 


(a) 


(b) 


(c) 


(d) 


(e) 




(g) 


(h) 


(i) 


90.1% 


91.3% 


89.7% 


94.1% 


92.3 
% 


91.2% 


82.55% 


86.4% 


(ii) 


87.5% 


87.2% 


84.5% 


87.2% 


90.1 
% 


88.5% 


73.1% 


86.2% 


(iii) 


88.3% 


89.2% 


86.3% 


91.3% 


90.1 
% 


86.6% 


74.9% 


90.4% 


(iv) 


85.3% 


84.2% 


83.5% 


86.5% 


87.1 

% 


87.2% 


72.6% 


85.5% 


(v) 


72.3% 


71.5% 


73.6% 


72.5% 


72.3 
% 


79.7% 


61.2% 


67.8% 


(vi) 


75.5% 


76.8% 


77.2% 


80.1% 


81.3 

% 


76.4% 


70.0% 


69.5% 


(vii) 


89.1% 


88.2% 


84.5% 


88.3% 


87.2 
% 


88.2% 


80.1% 


81.3% 


(viii) 


92.3% 


90.0% 


91.5% 


95.6% 


96.2 
% 


91.4% 


82.3% 


84.4% 


(ix) 


87.1% 


82.3% 


82.3% 


86.6% 


85.5 

% 


81.1% 


82.9% 


75.5% 


(x) 


91.3% 


90.1% 


89.7% 


96.5% 


93.1 

% 


89.95 
% 


83.4% 


89.8% 



However, in a conventional wafer manufacturing facility, operating all 80 types 
of sampling combinations is difficult. Hence, providing an integrative sampling 
1 5 strategy for the overlay error model of this invention is very important. 
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According to the aforementioned comparison, an integrative overlay sampling 
strategy is provided. The sampling procedure includes: 

(a) determining the state within the intra-field, that is, determining the number 
and positions of the selected overlays within the intra-field for example, selecting four 
positions from (1) - (8) in Fig. 6 and the overlay (9) in the central position serving as the 
sampling state of the intra-field overlay location, and the intra-field overlay sampling 
state, for example, the eight types of intra-field overlay patterns in Fig. 7 as sampling 
state, thereafter, determining the sum of the overlays, for example, 10 to 255, and 
ranking each type of intra-field overlay sampling state similarly on the wafer, and 
measuring the locations and actual overlay error values of each overlay type of the 
sampling states within the intra-field; 

(b) using the measured values and utilizing the least square method to obtain the 
coefficients according to the overlay error model formulas (23) and (24), computing 
related square coefficient values between each type of overlay sampling states within 
the intra-field and the overlay model (or R-square represents the degree of the assessed 
variance that can be explained by the model), selecting the highest related square 
coefficient among the overlay sampling states within the intra-field, for example, the 
intra-field sampling state (d); 

(c) determining a plurality of inter-field overlay sampling patterns, that is, 
determining the locations and number of types of overlay sampling states within the 
inter-field, the location, for example, is in any one of the wafer grids in Fig. 8, the 
number is, for example, between 2 to 51 and the inter-field overlay sampling pattern is 
preferably the 10 types of pattern shown in Figs. 9 to 18, wherein the intra-field state is 
selected according to the highest square coefficient in step (b), for example, the intra- 
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field sampling state (d), and measuring the locations and actual overlay error values of 
each overlay sampling pattern within the inter-field; 

(d) using the measured values in step (c) and utilizing the least square method to 
find the coefficients in the model formulas (23) and (24), and computing related square 
coefficient values between each overlay sampling pattern within the inter-field and the 
overlay model, selecting an inter-field overlay sampling pattern having the highest 
related square coefficients, for example, the inter-field overlay sampling pattern (viii) or 
(x); and 

(e) determining the combination of intra-field overlay sampling state and inter- 
field overlay sampling pattern and its relationship with the model so that the highest 
related coefficients are found, for example, the intra-field sampling state (d) and inter- 
field sampling pattern (viii) or the intra-field sampling state (d) and the inter-field 
sampling pattern (x), performing compensation and modification using the measured 
overlay errors for the same stepper machine so that the number of overlay errors of the 
exposed patterns is reduced to a tolerable range and yield of semiconductor device is 
improved. 

According to the aforementioned overlay error model and overlay error 
sampling strategy, this invention also provides a device for implementing the model and 
the sampling strategy. 

The aforementioned related square coefficient indicates the degree of 
resemblance of the model. This invention uses the variable parameters in the overlay 
model to compensate for any overlay errors. The residual parameters of the intra-field 
sampling state (a) and intra-field sampling state (d) refer to portion of the overlay errors 
that cannot be compensated according to the estimation produced by the model Fig. 
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20 is a bar graph showing residual value between a conventional intra-field overlay 
sampling state (a) and the optimized intra-field overlay sampling state (d) according to 
this invention. As shown in Fig. 20, the residual parameter of the sampling state (d) is 
more concentrated than the conventional sampling state. The overlay norm 
computation is carried out using the residual value produced by 255 overlay, and the 
norm value v is defined as: 



Hence, the calculated value of the sampling state (d) for the overlay error norm 
is 0.598 while the overlay error norm for the sampling state (a) is 0.924. For sampling 
state (1), this indicates that sampling state (d) is able to reduce about 35.4 percent more 
overlay error and the residual analysis of the sampling state (d) also shows the 
effectiveness and advantage of the model provided by this invention. Fig. 21 is a 
graph analyzing the distribution of residual value in the intra-field overlay sampling 
state (d) according to this invention. 

In the measurement of overlay errors, the relationship between the sample 
number and the related square coefficient can be assessed based on the sampling cost 
and the effectiveness of the compensation as shown in Fig. 19. According to the 
current model and actual wafer manufacturing facilities, this invention provides an 
innovative intra-field and inter-field overlay model. Using data from an effective 
estimation to modify overlay errors, a sampling location better than those found by 
conventional methods is obtained. Hence, this invention is capable of reducing 
overlay errors by up to 35.4 percent. 

It will be apparent to those skilled in the art that various modifications and 
variations can be made to the structure of the present invention without departing from 




(25) 
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the scope or spirit of the invention. In view of the foregoing, it is intended that the 
present invention cover modifications and variations of this invention provided they fall 
within the scope of the following claims and their equivalents. 
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